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<110> APPLICANT: RANGANATHAN, AN AND 

<120> TITLE OF INVENTION: DNA MANIPULATION METHODS AND APPLICATIONS FOR SYNTHETIC 
ENZYMES 

<130> FILE REFERENCE: SHW- 009US 
<140> CURRENT APPLICATION NUMBER: 10/009,873 
<141> CURRENT FILING DATE: 2002-11-1 
<150> PRIOR APPLICATION NUMBER: PCT/GB00/022 86 
<151> PRIOR FILING DATE: 2000-06-12 
<150> PRIOR APPLICATION NUMBER: GB 9913694.7 
<151> PRIOR FILING DATE: 1999-06-11 
<160> NUMBER OF SEQ ID NOS : 30 
<170> SOFTWARE: Patent In Ver . 3.3 
<210> SEQ ID NO: 1 
<211> LENGTH: 12 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

oligonucleotide 
<400> SEQUENCE : 1 



aaatctagac cc 



12 



<210> SEQ ID NO: 2 
<211> LENGTH: 12 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

oligonucleotide 
<400> SEQUENCE: 2 

agatctagac cc 12 
<210> SEQ ID NO: 3 
<211> LENGTH: 3 74 
<212> TYPE: PRT 

<213> ORGANISM: Streptomyces lividans 
<400> SEQUENCE: 3 

55 Met Ala Gly Thr Asp Arg Glu Lys Ala Leu Asp Ala Ala Leu Ala Gin 

56 1 5 10 15 

58 He Glu Arg Gin Phe Gly Lys Gly Ala Val Met Arg Met Gly Asp Arg 

59 20 25 30 

61 Thr Asn Glu Pro He Glu Val He Pro Thr Gly Ser Thr Ala Leu Asp 

62 35 40 45 

64 Val Ala Leu Gly Val Gly Gly He Pro Arg Gly Arg Val Val Glu Val 

65 50 55 60 



file://C:\CRF4\Outhold\VsrJ009873.htm 



4/7/2006 



RAW SEQUENCE LISTING DATE: 04/07/2006 

PATENT APPLICATION: US/10/009,873 TIME: 08:31:48 



Input Set : A:\Sequence Listing.TXT 
Output Set: N:\CRF4\04072006\J009873.raw 



67 


Tyr 


Gly 


Pro 


Glu 


Ser 


Ser 


Gly 


Lys 


Thr 


Thr 


Leu 


Thr 


Leu 


His 


Ala 


Val 


68 
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70 










75 










80 


70 


Ala 


Asn 


Ala 


Gin 


Lys 


Ala 


Gly 


Gly 


Gin 


Val 


Ala 


Phe 


Val 


Asp 


Ala 


Glu 


71 










85 










90 










95 




73 


His 


Ala 


Leu 


Asp 


Pro 


Glu 


Tyr 


Ala 


Lys 


Lys 


Leu 


Gly 


Val 


Asp 


lie 


Asp 


74 








100 










105 










110 






76 


Asn 


Leu 


He 


Leu 


Ser 


Gin 


Pro 


Asp 


Asn 


Gly 


Glu 


Gin 


Ala 


Leu 


Glu 


He 


77 






115 










120 










125 








79 


Val 


Asp 


Met 


Leu 


Val 


Arg 


Ser 


Gly 


Ala 


Leu 


Asp 


Leu 


He 


Val 


He 


Asp 


80 




130 










135 










140 










82 


Ser 


Val 


Ala 


Ala 


Leu 


Val 


Pro 


Arg 


Ala 


Glu 


He 


Glu 


Gly 


Glu 


Met 


Gly 


83 


145 










150 










155 










160 


85 


Asp 


Ser 


His 


Val 


Gly 


Leu 


Gin 


Ala 


Arg 


Leu 


Met 


Ser 


Gin 


Ala 


Leu 


Arg 


86 










165 










170 










175 




88 


Lys 


He 


Thr 


Ser 


Ala 


Leu 


Asn 


Gin 


Ser 


Lys 


Thr 


Thr 


Ala 


He 


Phe 


He 


89 








180 










185 










190 






91 


Asn 


Gin 


Leu 


Arg 


Glu 


Lys 


He 


Gly 


Val 


Met 


Phe 


Gly 


Ser 


Pro 


Glu 


Thr 


92 






195 










200 










205 








94 


Thr 


Thr 


Gly 


Gly 


Arg 


Ala 


Leu 


Lys 


Phe 


Tyr 


Ala 


Ser 


Val 


Arg 


Leu 


Asp 


95 




210 










215 










220 










97 


lie 


Arg 


Arg 


He 


Glu 


Thr 


Leu 


Lys 


Asp 


Gly 


Thr 


Asp 


Ala 


Val 


Gly 


Asn 


98 


225 










230 










235 










240 



100 


Arg 


Thr 


Arg 


Val 


Lys 


Val 


Val 


Lys 


Asn 


Lys 


Val 


Ala 


Pro 


Pro 


Phe 


Lys 


101 










245 










250 










255 




103 


Gin 


Ala 


Glu 


Phe 


Asp 


He 


Leu 


Tyr 


Gly 


Gin 


Gly 


He 


Ser 


Arg 


Glu 


Gly 


104 








260 










265 










270 






106 


Gly 


Leu 


He 


Asp 


Met 


Gly 


val 


Glu 


Asn 


Gly 


Phe 


Val 


Arg 


Lys 


Ala 


Gly 


107 






275 










280 










285 








109 


Ala 


Trp 


Tyr 


Thr 


Tyr 


Glu 


Gly 


Asp 


Gin 


Leu 


Gly Gin 


Gly Lys 


Glu 


Asn 


110 




290 










295 










300 










112 


Ala 


Arg 


Asn 


Phe 


Leu 


Lys 


Asp 


Asn 


Pro 


Asp 


Leu 


Ala 


Asn 


Glu 


He 


Glu 


113 


305 










310 










315 










320 


115 


Lys 


Lys 


He 


Lys 


Gin 


Lys 


Leu 


Gly 


Val 


Gly 


Val 


His 


Pro 


Glu 


Glu 


Ser 


116 










325 










330 










335 




118 


Ala 


Thr 


Glu 


Pro 


Gly 


Ala 


Asp 


Ala 


Ala 


Ser 


Ala 


Ala 


Pro 


Ala 


Asp 


Ala 


119 








340 










345 










350 






121 


Ala 


Pro 


Ala 


Val 


Pro 


Ala 


Pro 


Thr 


Thr 


Ala 


Lys 


Ala 


Thr 


Lys 


Ser 


Lys 


122 






355 










360 










365 








124 


Ala 


Ala 


Ala 


Ala 


Lys 


Ser 























125 370 

128 <210> SEQ ID NO: 4 

129 <211> LENGTH: 372 

130 <212> TYPE: PRT 

131 <213> ORGANISM: Streptomyces ambofaciens 

133 <400> SEQUENCE: 4 

134 Met Ala Gly Thr Asp Arg Glu Lys Ala Leu Asp Ala Ala Leu Ala Gin 

135 15 10 15 

137 He Glu Arg Gin Phe Gly Lys Gly Ala Val Met Arg Met Gly Asp Arg 

138 20 25 30 
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140 


Ser 


Lys 


Glu 


Pro 


He 


Glu 


Val 


lie 


Pro 


Thr Gly 


Ser 


Thr 


Ala 


Leu 


Asp 


141 






35 










40 








45 








143 


Val 


Ala 


Leu 


Gly 


Val 


Gly Gly Leu 


Pro 


Arg Gly 


Arg Val 


lie 


Glu 


Val 


144 




50 










55 








60 










146 


Tyr 


Gly 


Pro 


Glu 


Ser 


Ser Gly 


Lys 


Thr 


Thr Leu 


Thr 


Leu 


His 


Ala 


Val 


147 


65 










70 








75 










80 


149 


Ala 


Asn 


Ala 


Gin 


Lys 


Ala Gly Gly 


Gin 


Val Ala 


Phe 


Val 


Asp 


Ala 


Glu 


150 










85 










90 








95 




152 


His 


Ala 


Leu 


Asp 


Pro 


Glu Tyr Ala 


Gin 


Lys Leu 


Gly Val 


Asp 


He 


Asp 


153 








100 










105 








110 






155 


Asn 


Leu 


He 


Leu 


Ser 


Gin 


Pro Asp 


Asn 


Gly Glu 


Gin 


Ala 


Leu 


Glu 


He 


156 






115 










120 








125 








158 


Val 


Asp 


Met 


Leu 


Val 


Arg 


Ser 


Gly 


Ala 


Leu Asp 


Leu 


lie 


Val 


He 


Asp 


159 




130 










135 








140 










161 


Ser 


Val 


Ala 


Ala 


Leu 


Val 


Pro 


Arg 


Ala 


Glu He 


Glu Gly 


Glu 


Met 


Gly 


162 


145 










150 








155 










160 


164 


Asp 


Ser 


His 


Val 


Gly 


Leu 


Gin 


Ala 


Arg 


Leu Met 


Ser 


Gin 


Ala 


Leu 


Arg 


165 










165 










170 








175 




167 


Lys 


He 


Thr 


Ser 


Ala 


Leu 


Asn 


Gin 


Ser 


Lys Thr 


Thr 


Ala 


lie 


Phe 


He 


168 








180 










185 








190 






170 


Asn 


Gin 


Leu 


Arg 


Glu 


Lys 


He 


Gly 


Val 


Met Phe 


Gly Ser 


Pro 


Glu 


Thr 


171 






195 










200 








205 








173 


Thr 


Thr 


Gly 


Gly 


Arg 


Ala 


Leu 


Lys 


Phe 


Tyr Ala 


Ser 


Val 


Arg 


Leu 


Asp 


174 




210 










215 








220 










176 


He 


Arg 


Arg 


He 


Glu 


Thr 


Leu 


Lys 


Asp 


Gly Thr 


Asp 


Ala 


Val 


Gly 


Asn 


177 


225 










230 








235 










240 


179 


Arg 


Thr 


Arg 


Val 


Lys 


Val 


Val 


Lys 


Asn 


Lys Val 


Ala 


Pro 


Pro 


Phe 


Lys 


180 










245 










250 








255 




182 


Gin 


Ala 


Glu 


Phe 


Asp 


He 


Leu 


Tyr 


Gly 


Gin Gly 


He 


Ser 


Arg 


Glu 


Gly 


183 








260 










265 








270 






185 


Gly 


Leu 


He 


Asp 


Met 


Gly Val 


Glu 


His 


Gly Phe 


Val 


Arg 


Lys 


Ala 


Gly 


186 






275 










280 








285 








188 


Ala 


Trp 


Tyr 


Thr 


Tyr 


Glu Gly Asp 


Gin 


Leu Gly 


Gin 


Gly 


Lys 


Glu 


Asn 


189 




290 










295 








300 










191 


Ala 


Arg 


Asn 


Phe 


Leu 


Lys 


Asp Asn 


Pro 


Asp Leu 


Ala 


Asn 


Glu 


lie 


Glu 


192 


305 










310 








315 










320 


194 


Lys 


Lys 


He 


Lys 


Glu 


Lys 


Leu Gly 


Val 


Gly Val 


Arg 


Pro 


Glu 


Glu 


Pro 


195 










325 










330 








335 




197 


Thr 


Ala 


Thr 


Glu 


Ser 


Gly 


Pro Asp 


Ala 


Ala Thr 


Ala 


Glu 


Ser 


Ala 


Pro 


198 








340 










345 








350 






200 


Ala 


Val 


Pro 


Ala 


Pro 


Ala 


Thr 


Ala 


Lys 


Val Thr 


Lys 


Ala 


Lys 


Ala 


Ala 


201 






355 










360 








365 








203 


Ala 


Ala 


Lys 


Ser 
























204 




370 




























207 


<210> SEQ ID NO: 


: 5 






















208 


<211> LENGTH: 1125 






















209 


<212> TYPE: 


DNA 
























210 


<213> ORGANISM: 


Streptomyces lividans 












212 


<400> SEQUENCE: 


5 
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213 atggcaggaa ccgaccgcga gaaggccctg gacgccgcgc tcgcacagat tgaacggcaa 60 

214 ttcggcaagg gcgcggtcat gcgcatgggt gaccggacca acgagcccat cgaggtcatc 12 0 

215 ccgaccgggt ctaccgcgct cgacgtggcc ctcggcgtcg gaggcatccc gcgtggccgt 180 

216 gtcgtggagg tctacggccc cgagtcctcg ggcaagacga ccctgaccct gcacgcggtg 24 0 

217 gcgaacgcgc agaaggccgg cggccaggtc gcgttcgtgg acgccgagca cgccctcgac 300 

218 cccgagtacg cgaagaagct cggtgtcgac atcgacaacc tgatcctgtc ccagccggac 360 

219 aacggtgagc aggccctgga gatcgtggac atgctggtcc gctccggcgc cctcgacctc 420 

220 atcgtcatcg actccgtcgc cgcgctcgtc ccgcgcgcgg agatcgaggg cgagatgggc 480 

221 gacagccacg tcggtctgca ggcccggctg atgagccagg ccctgcggaa gatcaccagc 540 

222 gcgctcaacc agtccaagac caccgcgatc ttcatcaacc agctccgcga gaagatcggc 600 

223 gtgatgttcg gctccccgga gaccacgacc ggtggccggg cactgaagtt ctacgcctcg 660 

224 gtgcgactcg acatccggcg tatcgagacg ctgaaggacg gcaccgacgc ggtcggcaac 720 

225 cgcacccgcg tcaaggtggt caagaacaag gtcgcgccgc ccttcaagca ggccgagttc 780 

226 gacatcctct acggccaggg catcagccgc gagggcggtc tgatcgacat gggcgtggag 840 

227 aacggcttcg tccgcaaggc cggcgcctgg tacacgtacg agggcgacca gctcggtcag 900 

228 ggcaaggaga acgcgcgcaa cttcctgaag gacaaccccg acctggccaa cgagatcgag 960 

22 9 aagaagatca agcagaagct gggcgtcggc gtgcaccccg aggagtcggc caccgagccc 102 0 

230 ggcgcggacg ccgcctccgc cgccccggcc gacgccgcac cggcggtgcc cgcacccacg 1080 

231 accgccaagg ccaccaagtc caaggccgcg gcagccaaga gctga 1125 

234 <210> SEQ ID NO: 6 

235 <211> LENGTH: 1119 

236 <212> TYPE: DNA 

237 <213> ORGANISM: St reptomyces ambofaciens 

23 9 <400> SEQUENCE: 6 

240 atggcaggaa ccgaccgcga gaaggctctt gacgccgcac tcgcacagat tgaacggcag 60 

241 ttcggcaagg gcgcggtcat gcgcatgggc gaccggtcga aggagcccat cgaggtcatc 12 0 

242 ccgaccgggt cgaccgcgct cgacgtggcc ctcggcgtcg gcggcctgcc gcgcggccgc 180 

243 gtcatcgagg tctacggtcc ggagtcctcc ggtaagacga ccctgaccct gcacgccgtg 240 

244 gcgaacgcgc agaaggccgg cggccaggtg gcgttcgtgg acgcggagca cgccctcgac 3 00 

245 cccgagtacg cccagaagct cggcgtcgac atcgacaacc tgatcctgtc ccagccggac 360 

246 aacggtgagc aggccctgga gatcgtggac atgctggtcc gctccggcgc cctcgacctc 420 

247 atcgtcatcg actccgtcgc cgcgctcgtc ccgcgcgcgg agatcgaggg cgagatgggt 480 

248 gacagccacg tcggtctcca ggcccggctg atgagccagg cgctccggaa gatcaccagc 540 

249 gcgctcaacc agtccaagac caccgcgatc ttcatcaacc agctccgcga gaagatcggc 600 

250 gtcatgttcg gctccccgga gaccacgacc ggtggccggg cgctcaagtt ctacgcctcg 660 

251 gtgcgactcg acatccgacg catcgagacg ctcaaggacg gcaccgacgc ggtcggcaac 720 

252 cgcacgcgcg tcaaggtcgt caagaacaag gtcgcgccgc ccttcaagca ggccgagttc 780 

253 gacatcctct acggccaggg catcagccgc gagggcggcc tgatcgacat gggcgtggag 840 

254 cacggcttcg tccgcaaggc cggcgcctgg tacacgtacg agggcgacca gctcggccag 900 

255 ggcaaggaga acgcgcgcaa cttcctgaag gacaaccccg acctcgccaa cgagatcgag 960 

256 aagaagatca aggagaagct gggcgtcgga gtccgtcccg aggagccgac ggccaccgag 1020 

257 tccggaccgg acgccgcgac ggccgaatcc gcaccggcgg tgcccgcgcc cgcgaccgcc 1080 

258 aaggt caeca aggecaagge cgcggcagcc aagagctga 1119 

261 <210> SEQ ID NO: 7 

262 <211> LENGTH: 14 

263 <212> TYPE: DNA 

264 <213> ORGANISM: Artificial Sequence 

266 <220> FEATURE: 

267 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
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268 ol igonuc 1 eo t ide 

270 <220> FEATURE: 

271 <221> NAME /KEY : modif ied_base 

272 <222> LOCATION: (6) . . (14) 

273 <223> OTHER INFORMATION: a, c, g, t, unknown or other 
275 <400> SEQUENCE: 7 

W--> 276 ggatgnnnnn nnnn 14 

279 <210> SEQ ID NO: 8 

280 <211> LENGTH: 18 

281 <212> TYPE: DNA 

282 <213> ORGANISM: Artificial Sequence 

284 <220> FEATURE: 

285 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

286 oligonucleotide 

288 <220> FEATURE: 

289 <221> NAME/KEY: modif ied_base 

290 <222> LOCATION: (1) . . (13) 

2 91 <223> OTHER INFORMATION: a, c, g, t, unknown or other 
293 <400> SEQUENCE: 8 
W--> 294 nnnnnnnnnn nnncatcc 18 

297 <210> SEQ ID NO: 9 

298 <211> LENGTH: 35 

299 <212> TYPE: DNA 

300 <213> ORGANISM: Artificial Sequence 

302 <220> FEATURE: 

303 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

304 oligonucleotide 

306 <400> SEQUENCE: 9 

307 ggcatatggc ggacctgtca aagctctccg acagt 35 

310 <210> SEQ ID NO: 10 

311 <211> LENGTH: 36 

312 <212> TYPE: DNA 

313 <213> ORGANISM: Artificial Sequence 

315 <220> FEATURE: 

316 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

317 oligonucleotide 
319 <400> SEQUENCE: 10 

32 0 ggtctagatc ccagccgcgg tcggtcggca gtcccg 3 6 

323 <210> SEQ ID NO: 11 

324 <211> LENGTH: 53 
32 5 <212> TYPE: DNA 

326 <213> ORGANISM: Artificial Sequence 
32 8 <22 0> FEATURE: 

329 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

330 oligonucleotide 

332 <400> SEQUENCE: 11 

333 ggtctagact cgctgttcca ccccgacccc acgcgctcgg gcaccgcgca cca 53 

336 <210> SEQ ID NO: 12 

337 <211> LENGTH: 48 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which, presents at least one n or Xaa. 



Seq#:7; N POS . 6' 
Seq#:8; N Pos . 1,2,3,4,5,6,7,8,9,10,11,12,13 
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L:ll M:256 W: Invalid Numeric Header Field, Wrong Current FILING DATE : YYYY - MM - DD 
L:276 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:7 after pos . : 0 
L:294 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 8 after pos . : 0 
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